Machine Translation of Bio-Thesauri
نویسندگان
چکیده
In this paper we describe how we applied a generalpurpose machine translation tool for translating biomedical thesauri. We used corresponding terms in parallel corpora to check the validity of the translations. The advantage of this approach is that a single corresponding set of terms can be verified where techniques to retrieve translations from a parallel corpus do not exploit the knowledge contained in current state of the art machine translation software.
منابع مشابه
Cross-lingual RDF Thesauri Interlinking
Various lexical resources are being published in RDF. To enhance the usability of these resources, identical resources in different data sets should be linked. If lexical resources are described in different natural languages, then techniques to deal with multilinguality are required for interlinking. In this paper, we evaluate machine translation for interlinking concepts, i.e., generic entiti...
متن کاملExperiments with the Eurospider Retrieval System for CLEF 2001
Eurospider participated in both the multilingual and monolingual retrieval tasks for CLEF 2001. Our multilingual experiments, the main focus of this year’s work, combine multiple approaches to cross-language retrieval: machine translation, similarity thesauri, and machine-readable dictionaries. We experimented with both query translation and document translation. The monolingual experiments foc...
متن کاملLattice-based Modelling of Thesauri
This paper revisits ideas about the use of lattices as underlying conceptual structures in information retrieval and machine translation as suggested by researchers in the 1950s and 1960s. It describes how these ideas were originally presented, how they are related to each other and how they are represented in modern research, particularly with respect to Formal Concept Analysis.
متن کاملThe Eurospider Retrieval System and the TREC-8 Cross-Language Track
This year the Eurospider team, with help from Columbia, focused on trying different combinations of translation approaches. We investigated the use and integration of pseudo-relevance feedback, multilingual similarity thesauri and machine translation. We also looked at different ways of merging individual crosslanguage retrieval runs to produce multilingual result lists. We participated in both...
متن کاملLeveraging Reusability: Cost-Effective Lexical Acquisition for Large-Scale Ontology Translation
Thesauri and ontologies provide important value in facilitating access to digital archives by representing underlying principles of organization. Translation of such resources into multiple languages is an important component for providing multilingual access. However, the specificity of vocabulary terms in most ontologies precludes fully-automated machine translation using general-domain lexic...
متن کامل